Clustering construction on a multimodal probability model

نویسندگان

  • Jian Yu
  • Miin-Shen Yang
  • Pengwei Hao
چکیده

In 2009, Yu et al. proposed a multimod al probability model (MPM) for clustering. This paper makes advanced clustering constructions on the MPM. We first reconstruct most existing clustering algorithms, such as the k-means, fuzzy c-means, possibilistic c-means, mean shift, classification maximum likelihood, and latent class methods, by establishing the relationships between these clustering algorithms and the MPM. Under our clustering construction, we find that the MPM can be seen as a basic probability model for most existing clustering algorithms. We then construct new clust ering frameworks based on the MPM. One of the frameworks develops new penalized-type clustering algorithms. Another one induces entropy-type clustering algorithms, especially with sample-weighted clustering. Several numerical and real data sets are made for compariso ns. These experimental results show that our clustering constructions based on the MPM can produce useful and effective clustering algorithms. 2013 Elsevier Inc. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Multimodal Probability Model for Cluster Analysis

Cluster analysis is a tool for data analysis. It is a method for finding clusters of a data set with most similarity in the same group and most dissimilarity between different groups. In general, there are two ways, mixture distributions and classification maximum likelihood method, to use probability models for cluster analysis. However, the corresponding probability distributions to most clus...

متن کامل

An Efficient Predictive Model for Probability of Genetic Diseases Transmission Using a Combined Model

In this article, a new combined approach of a decision tree and clustering is presented to predict the transmission of genetic diseases. In this article, the performance of these algorithms is compared for more accurate prediction of disease transmission under the same condition and based on a series of measures like the positive predictive value, negative predictive value, accuracy, sensitivit...

متن کامل

Simultaneous Co-clustering and Modeling of Market Data

For difficult prediction problems, practitioners often segment the data into relatively homogenous groups and then build a model for each group. This two-step procedure usually results in simpler, more interpretable and actionable models without any loss in accuracy. We consider two important marketing problems, predicting customer-product preference and simultaneous market segmentation and str...

متن کامل

Prediction on Travel-Time Distribution for Freeways Using Online Expectation Maximization Algorithm

This paper presents a stochastic model-based approach to freeway travel-time prediction. The approach uses the Link-Node Cell Transmission Model (LN-CTM) to model traffic and provides a probability distribution for travel time. On-ramp and mainline flow profiles are collected from loop detectors, along with their uncertainties. The probability distribution is generated using Monte 5 Carlo simul...

متن کامل

A model for distribution centers location-routing problem on a multimodal transportation network with a meta-heuristic solving approach

Nowadays, organizations have to compete with different competitors in regional, national and international levels, so they have to improve their competition capabilities to survive against competitors. Undertaking activities on a global scale requires a proper distribution system which could take advantages of different transportation modes. Accordingly, the present paper addresses a location-r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Sci.

دوره 237  شماره 

صفحات  -

تاریخ انتشار 2013